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ABSTRACT 

Large SZ selected galaxy cluster surveys are beginning imminently. We compare the 
dependence of the galaxy cluster angular power spectrum on cosmological parameters, 
different modeling assumptions and statistical observational errors. We quantify the de- 
generacies between theoretical assumptions such as the mass function and cosmological 
parameters such as cxs. We also identify a rough scaling behavior of this angular power 
spectrum with erg alone. 

Subject headings: cosmic microwave background — cosmological parameters — galaxies: 
clusters: general 

1. Introduction 

The Sunyaev Zel'dovich (hereafter SZ) effect is the upscattering of cosmic microwave back- 
ground photons by the hot electrons in galaxy clusters (Sunyaev & Zel'dovich (1972; 1980)). As 
the surface brightness of a cluster in the SZ is redshift independent, and the signal is relatively 
insensitive to the poorly known cluster core structure, the SZ effect is a powerful tool to select 
and study galaxy clusters. The field of SZ measurements has progressed rapidly from a handful 
of SZ detections for already-known galaxy clusters to SZ maps of several clusters to the current 
stage: cluster surveys designed exclusively for SZ detection. For reviews, see for example Birkin- 
shaw (1999) and Carlstrom, Holder & Reese (2002). Some surveys are already in progress such as 
ACBAR^ and SZA^. Others are on the verge of taking data, such as AMI^ and APEX"^, and ACT^ 
expects to be ready by the end of 2006. Thousands of previously unknown clusters will be observed 

^http://astrononiy.sussex. ac.uk/~romer/research/blind. html 
^http://astro. uchicago.edu/sza/overview. html 
'^http://www.mrao. cam. ac.uk/telescopes/ami/index. html 
^ ht tp : / /bolo . berkeley. edu / apexsz /index, html 
^http://www. hep. upenn.edu/'--^angelica/act/act. html 
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by these experiments, starting with APEX. Future surveys such as SPT^ and Planck''' are taking 
place within the next five years and should observe tens of thousands of clusters. For a full list of 
current and upcoming experiments, see for example the CMB experiment page at LAMBDA^. 

The prospect of combining these theoretical predictions and observational data on galaxy clus- 
ters to constrain fundamental cosmological parameters has led to much excitement (e.g. Holder, 
Haiman & Mohr (2001), Majumdar & Mohr (2003) and references below). With large survey data 
on the horizon, and the detailed design of later experiments taking shape, it is timely to identify 
which theoretical assumptions are the most crucial to pin down in order to use the science harvest 
from these experiments. The initial galaxy cluster information these surveys will produce will be 
number counts and the angular correlation function/cluster power spectrum. 

We consider here the angular power spectrum of clusters found by an SZ survey. We calculate 
the effect of cosmological, modeling and observational parameters on this power spectrum. The 
cosmological parameters we consider are Qm and erg. In the modeling, analytic predictions (based 
on the extremely good fit of analytic models to dark matter simulations) can predict the expected 
galaxy cluster counts, masses, and positions. These fits often differ at about the 10% level. In 
addition, cluster observations generally measure some proxy for the cluster mass (temperature. 
X-ray flux, SZ flux, velocities, shear in images). These proxies generally depend on more complex 
astrophysical properties (gasdynamics, for instance, or the state of relaxation of the cluster), and/or 
are difficult to connect cleanly to the mass (e.g. weak lensing masses, Metzler, White & Loken 
(2001)). We consider different analytic fits and assumptions for mass proxies. For the observational 
parameters we consider sensitivity and area of survey, using those of APEX for illustration. We 
quantify and compare the effects on the power spectrum of these uncertainties. Many of these 
uncertainties are degenerate, we identify which parameter changes are roughly equivalent, and 
which are not. 

The SZ properties of clusters in upcoming surveys are of great interest and many aspects have 
been studied in previous work. Constraints from the angular correlations of clusters found in the 
SZ have been studied by Moscardini et al (2002), Diaferio et al (2003), Mei & Bartlett (2003; 2004). 
These papers each varied a combination of different quantities (e.g. erg, normalization of SZ flux 
as function of mass, biases, scaling of SZ flux with mass and redshift). Majumdar & Mohr (2004) 
consider the two-dimensional power spectrum and Wang et al (2004) consider the three dimensional 
power spectrum in conjunction with other measurements. A similar quantity, the angular correlation 
of the SZ temperature spectrum, has been considered by Komatsu &, Kitayama (1999), Komatsu & 
Seljak (2002), and Majumdar & Mohr (? ). Battye & Weller (2003) studied similar systematics to 
those considered here, for number counts as a function of redshift, dN/dz. Uncertainties which were 



^http: / / astro.uchicago.edu/spt 

''http://astro.estec.esa.nl/SA-general/Projects/Planck 
^http://lanibda.gsfc. nasa.gov/ 
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treated separately in (different) previous works can be degenerate, leading to potentially misleading 
interpretations. We quantify the changes in cosmological parameters which mimic these other 
theoretical assumptions. 

In §2, we discuss the selection requirement for the SZ cluster catalogue, §3 defines the angular 
power spectrum and angular correlation function. Our new results are in §4, which considers 
the cosmological, modeling and observational uncertainties in the power spectrum and compares 
them. We discuss many of the uncertainties in detail, and then compare the changes caused by 
varying many different assumptions. We both find degeneracies and find which assumptions are not 
degenerate. Some of our assumptions have been considered separately in other works mentioned 
above (often for the correlation function rather than the power spectrum). By combining previously 
considered changes in assumptions and newer ones all together in a homogeneous manner and 
comparing to the same standard reference model we are able to quantify the relative importance 
of different theoretical assumptions in relation to cosmological parameters and to each other. §5 
concludes. Unless otherwise stated, we will take the Hubble constant tohe h — 0.7, baryon fraction 
Qft/i^ ^ 0.02, in a flat ACDM universe with = 0.3, Qa = 0.7. 



An SZ selected galaxy cluster catalogue is one that includes all clusters above a certain minimum 
SZ flux or (equivalently) above some minimum Y parameter Ymin- (The Y parameter is defined in 
detail below.) For a general review of the SZ effect, see for example Birkinshaw (1999), Rephaeli 
(1995a), and Sunyaev & Zel'dovich (1980). There are two SZ effects, thermal and kinetic. The 

(frequency dependent) thermal SZ effect is the change in the CMB spectrum due to random thermal 
motion of the intracluster electrons, and the (frequency independent) kinetic SZ effect is the change 
in the CMB spectrum due to bulk peculiar motions. The kinetic SZ effect is negligible compared to 
the thermal SZ effect for the purpose of cluster selection under study here; consequently we restrict 
our attention to the thermal SZ effect; "SZ effect" hereafter means thermal SZ effect. 

The thermal SZ effect can be described in terms of a CMB flux increment or decrement using 
the dimensionless Comptonization parameter y{9), 



where the prefactor5((a;) = {x^^—4:){l+5rei{x)) includes frequency dependence, x = hpU / {kBTcMB) — 
iz/BQ.SAGHz, and we will neglect the small relativistic correction 5rei, which gives only a few percent 
effect for the hottest clusters (Rephaeh (1995b), Itoh et al (1998), Nozawa et al. (2000), Fan & Wu 
(2003)). We will be interested in the integrated Y parameter. 



2. SZ selected galaxy cluster catalogues 



AT 



g{x)y{9) 



(1) 




(2) 
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where the integral is over the sohd angle subtended by the cluster. This can be written in terms 
of cluster properties as y{9) is the integration of pressure along the line of sight which passes at an 
angle 9 away from the center of the cluster: 

y{0) f dirien . (3) 

mgC^ J 

Here hp, (Tt, kB,me,ne,Te are respectively the Planck constant, Thomson cross section, Boltzmann 
constant, electron mass, intracluster electron density and temperature. The number of electrons 
along the hne of sight to the cluster mass is taken to be 



/ dndlrie = M^;^ (4) 



dy^ J dQdiue — 

where = TTz^ lo e^^-^'' ^(^) ~ •\/f^m(l + z)^ + ^a, Myir is the virial mass, fgas is the intra 
cluster gas fraction^ 

= 0.10/i-3/2M,W(l + O.lOMfsO-^S) (5) 

(Lin, Mohr and Stanford (2003), M15 = io^^-^Mq ) ' ~ 1.143 is the mean mass per electron and 
nip is the proton mass. Then the electron density weighted average temperature 

is given, using virialization arguments (see e.g. Battye & Weller (2003)) by 

where Ac (-2) = 187r^+82(fim(-2) — 1)— 39(r2m(-2) — 1)^- We have taken the 2; dependence from Pierpaoli 
et al (2003). Note this assumes that only electrons within the virial radius are contributing to the 
SZ effect. 

Although this form and a specific can be "derived" using virialization arguments, one can 
also just define as the constant of proportionality in the above. The above mass-temperature 
relation seems to work well for X-ray temperatures of high mass clusters and most measurements 
of for the above relation are done in the X-ray, we will call the X-ray value T^-'^°-y , Simulations 
find T^-'^°'y to be ~ 1.2 keV, while observations tend to prefer a higher values, T^-'^°'y 2.0 keV 
(for a recent compilation see, e.g., Huterer & White (2002), and for detailed discussion of subtleties 
in X-ray temperature definitions see, e.g. Mathiesen & Evrard (2001) and more recently Borgani 
et al (2004), Mazzotta et al (2004) and Rasia et al (2005) and references therein; convergence is 



^Note that for a Myir = lO^^h^^ Mq cluster wc get fgas = O-OGh^"^^^. The factor of h is included to make 
connection with other definitions, as Lin et al (2003) point out, the variation with h is not actually a simple scaling. 
We keep h fixed in this paper. Note that measurements of fgas implicitly require gas physics theoretical modeling. 



- 5 - 

improving steadily (Kravtsov (2005)) ). For the calculations here we will need T^^ . There is no a 
priori reason why the X-ray temperature and the SZ temperature normalizations are identical, as 
they get the bulk of their signal from different parts of the cluster. We discuss this in more detail 
later in section §4.2 when we consider modeling uncertainties. In addition, the above is in terms 
of Mmr\ several mass definitions are in use in the literature. If these differences are not taken into 
account correctly (White (2001)) via mass conversion, an apparent (but incorrect) change in T^^ 
will result, more discussion on this issue is in section §4.2. Combining these definitions and using 
again M15 = iois^-Tm^ ' 

Y ^ J J (MUeTe 

= 1.69 X 10^f,,,h^M',i'iAMzy)y^il - 2§^)(^MP£)2arcmin2 

The resulting SZ effect is a small distortion of the CMB of order ~ ImK. Results are often quoted 
in terms of fiux, with a conversion 

F, = 2.28 X 10 ^ x^—^ - 4 , mJy 

(e^ — 1)"' \ — 1 / arcmin 

For 143 GHz, the x dependent factor ~ —4 (which translates into Y — —AT/T for the Y parameter), 
for 90 GHz it is -3.3 and for 265 GHz it is -1-3.4. The SZ effect switches from a decrement to an 
increment in the CMB spectrum at 218 GHz. Thus, one way of distinguishing the thermal SZ effect 
from other sources, such as primary anisotropy or noise, is to see if it changes at 218 GHz. 

Specifics in going from this flux or corresponding Y value to a cluster detection depend upon 
the particulars of each experiment. We will consider the idealized case where an SZ experiment will 
detect all clusters above some minimum Y value, Ymin- Expcrimcnt-speciflc analysis and foUowup 
will be necessary to make reliable cluster identification. Interferometer experiments will "resolve 
out" some of the power and thus will effectively have a higher Ymm- In addition, false clusters 
detected due to alignments of low mass SZ sources will need to be discarded via some sort of follow 
up. The end result of this processing for an SZ cluster survey will be a catalogue of clusters (with 
angular positions) with SZ decrement above some minimum threshold value Ymin- 



^°One issue is the effect of beam size. For wide beams, confusion from point sources is a significant source of noise, 
White & Majumdar (2004), Knox et al (2003). For small beams, several pixels must be combined to produce the total 

cluster signal (see for example Battye & Weller (2003)) above the Ymin threshold, and there may be errors inherent 
to the corresponding cluster finding. These can be dealt with both in the data acquisition (e.g. by having more 
frequencies and an appropriate scanning strategy to help identify the point sources) and in the analysis; the effects 
particular to an experiment will depend strongly on the details of that experiment. An early example finding clusters 
in a noisy map was done by Schulz & White (2003), a more recent start-to- finish analysis of N-body simulations, 
including cluster finding and noise modeling, has been done for Planck SZ clusters by Geisbusch, Kneissl & Hobson 
(2004). See also MeUn, Bartlett & Delabrouille (2004), PierpaoU et al (2004), Vale & White (2005) for more on 
cluster finding. 
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We show in figure 1 a plot of the minimum cluster mass for a given l^jn as a function of 
redshift, for some representative Yj^in values expected with APEX and SPT. As mentioned above, 
the minimum mass depends on the mass temperature normalization Tf^ which is not well known 
and will be discussed in our section on modeling uncertainties, section §4.2. For illustration we have 
taken representative values for T^^^^-v from X-ray measurements, which we might expect to be close 
to T^^ . For Ymin we have taken the APEX's quoted 10 /iK sensitivity and multiplied by a factor of 
5, which would be a naive 5-a detection, fgas is taken to be O.lOh-^/'^M^i^'^^ /{I + O.lOMj^g^-^^) (Lin, 
Mohr & Stanford (2003)), in practice it also has a scatter. The slow change in Mmin with redshift 
is a feature of SZ selection, which in principle allows clusters of similar masses to be observed at 
all distances (Bartlett and Silk (1994), Barbosa et al (1996), Holder et al (2000), Bartlett (2001), 
Kneissl et al (2001), Diaferio et al (2003)). We can easily calculate observable quantities based on 
Mmin{Yiim, z) . This givcs an advantage over X-ray where the flux dims rapidly at higher redshifts. 
An additional advantage of SZ over X-ray is that the SZ signal strength depends upon the density, 
while the X-ray signal strength depends upon the density squared. Thus X-ray measurements boost 
the weight of the cluster core, which has poorly understood physics, in the detection. Conversely, 
the detection of SZ based on density means that the SZ signal is much more sensitive to line-of-sight 
contamination (White, Hernquist & Springel (2002)). Specifically, SZ effects are proportional to 
the total (hot) gas mass in the cluster along the line of sight (AT oc J UeTgasdl). In theoretical 
models the dominant contributions come from the region within ~ 0.2 — 0.4 of the cluster virial 
radius (Komatsu & Seljak (2002)). Note that at low redshifts the SZ selection probes very low mass 
objects, where the poorly understood gas physics dominates; as a result we will take a minimum 
redshift cut oi z — 0.2. Such a cut could be imposed experimentally in the foUowup. 



3. Analytic calculations 

In order to go from this cluster catalogue to cosmologically useful mass counts, theoretical 
processing and assumptions are needed. In this section we review and set the notation for the 
angular power spectrum/correlation function in terms of analytic quantities (which will be varied in 
the next section). Theoretical inputs to the analytic calculations include the choice of mass function, 
transfer function, biasing scheme, mass temperature relations (in particular T^^), the initial power 
spectrum and of course cosmological parameters. Some of these, such as the mass function, are well 
tested, other quantities such as Tf^, the mass temperature normahzation appropriate for calculating 
the Y parameter, are not determined well at all either theoretically or observationally, and widely 
varying approximations are in use. We will compare these approximations in the following. The 



The scatter found by Lin et al (2003) gives 

fgas = 0.10(±3%)/l-3/2M°5"8±27%/(^ ^ 0.10(±6%)Mf5°-25±28%) _ (^q) 
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Mass cut 




12 3 



z 



Fig. 1. — Minimum mass for given Ymin using equation 8. Note that as Ymin oc T^^ (Mmin)^ ' , 
fixed Ymin/T^^ will give the same Mmm- At 143 GHz, Y = —AT/T, at 150 GHz, corresponding to 
APEX, Y = — 0.96AT/T. Note that these are virial masses Myj^, M200 will be smaller by a factor 
of 0.77, see the discussion in section §4.2 about mass conversions. 



-8- 



angular power spectrum/correlation function can be calculated with analytic prescriptions for the 
mass function dn/dM{z) and the bias relating the dark matter correlations to the correlations for 
the galaxy clusters. The two dimensional correlation function is (Moscardini et al (2002), Diaferio 
et al (2003), Mei & Bartlett (2003)): 

r rldn Sr rldr, /-_(^_) dM, J-_^_^ dM,mu n)m2, r,)^^, M„ ^ - r.) 



w{e) 



1 {y^min ) 

(11) 

where ri,r2 are the radial distances of the clusters which have three dimensional positions ri,r2 
and ri ■ r2 = rir2 cos^. ^(Mi, M2, ri — r2) is the three dimensional cluster correlation function and 
(f){M,r) is the selection function (as a function of radial distance). In particular, 

0(Alr) = ^(^(r)) 

e(Mi, M2, ri - rs) = 6(Mi, z,{n))b{M2, ^2(r2))edm(ri - r^) 

Um{ri - r2) = Cdm{r) = J ^\ky^d\nk , r = |ri - raj (12) 

PUk) = D\z)Pok^T\k) 

In the above, the selection function is the number of clusters as a function of redshift (with the 
normalization included explicitly). The linear power spectrum Pun comes from an initial power 
spectrum with slope n, normalization Pq implied by our choice of cg, and transfer function T^(A;). 
The nonlinear power spectrum is derived from P^^ using the method of Smith et al (2003). The 
linear bias h is also defined above, nonlinear bias will be considered in the next section. 



The full two-dimensional correlation function thus becomes 

. . _ r rjdr, J- rjdr, J-„,„(^_) dM.^iAr,)) Jm^,„(^_) dM, ^ (.(r.))fe(Mi ,.i(n))fe(M„.,(r.))g,^(|n -..|) 

= r rMn) rldr,^r,)iam{\n - r,\) 

(13) 

where we have defined a generalized selection function (Moscardini et al (2002), Mei & Bartlett 
(2003)12) 

[r-VM..(...)^M^(.(r))dr] ^ ^ 

The above angular correlation function (and corresponding power spectrum) can be simplified 
via the Limber approximation (1953). As $ varies slowly relative to the correlation function, the 
integration can be rewritten as an integration over an average distance y = (ri + r2)/2 and one over 
relative separations x — {ri — r2)- The integration over x then gives a Bessel function Jq: 

w{0) J dyy'^yy J dxCdmiVW+^) .... 
^Jdyy'^yrjdlnknJoikye)^ ^ ' 



^^Note our selection function differs from that in Mei & Bartlett (2003) by the factor r^dr/dz, their expression for 
w{9) is equivalent. 
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(16) 



w{d) 



1 ■sr^m=+£ 
47r Z^m=-^ 



(17) 



because P£(cos^) ~ Jo{iO) for small angle. We can also define the inverse via 



/ 

^0 



'OO 



Ce = 27r 



w{e)Jo{ie)ede 



(18) 



The correlation function w{9) and its power spectrum Ce can be transformed to each other by the 
above equation, and therefore they encode the same information. However, to understand possible 
measurements and errors, use of Ce is usually preferable because the errors for different £ values are 
uncorrelated for small £. We'll use Ce for the most part in the following. 

For reference, we show in figure 2 the angular power spectrum for a representative model. The 
dependence on the possible reasonable choices for the analytic and cosmological model parameters 
is the subject of a later section. The choices taken here will be our "vanilla" model: for the dark 
matter we use the Evrard mass function, the Sheth-Tormen bias and the Eisenstein-Hu transfer 
function, for cluster parameters we take Tf^ = 1.2, fgas = 0.10h~^/^M^^^^^ /{I + O.lOMfg^-^^), for 
cosmological parameters we take cxg = 0.9, n = 1, Qm = 0.3, Qbh"^ = 0.02 and for experimental 
parameters we take F^m = 1-7 x 10~^. Our minimum redshift is Zmin = 0.2. We will use the same 
axes for (almost) all the plots, so that the relative impact of different effects are easily comparable. 
For the "vanilla" parameters one gets about 10 clusters per square degree. 

We have made some changes from earlier similar works in this review section. The basic 
expression above has been used by Mei and Bartlctt (2003) and a variant (to be discussed later) 
has been derived and used by Diaferio et al (2003). Figure 2 has the following changes from 
this earlier work: it gives the power spectrum rather than the correlation function, implements 
mass conversions (which can change masses by 30%), and includes a 10% scatter in the relation 
of minimum mass and due to the cluster parameter uncertainties. Mass conversions and the 
origin for the amount of scatter we have chosen are discussed in §4. We also differ Mei and Bartlett 
in using the Eisenstein-Hu transfer function. 



^"^Note that this is not identical to the "vanilla" model of Tegmark et al(2004), wc merely use the term to denote 
a model which we take to be the simplest in some rough sense. For example, our "vanilla" model includes a choice 
of and of mass function. 



'Vanilla" model 
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Fig. 2. — Our reference "vanilla" model. The Evrard mass function, Sheth-Tormen bias, Eisentein 
Hu transfer function, Tf^ = 1.2, fgas = 0.10h-^/^M^^^'^y{l + ClOMfg^-^^), as = 0.9, n = 1,1]„, = 
0.3, r^fc/i^ = 0.02 and for experimental parameters i^rnm = 1-7 x 10~^ and minimum redshift z^m = 
0.2. Mass conversions have been taken into account with the prescription given in White (2001). 
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4. Uncertainties 

We now compare the effects of cosmological, modeling, and observational uncertainties/unknowns 
on the cluster angular power spectrum. Some aspects of these, with different assumptions, have 
been considered previously for the cluster angular correlation function: Diaferio et al (2003) con- 
sider two cosmological models and vary and the bias, Mei and Bartlett (2003; 2004) vary F^m; 
CTg, ilm and T^^. For the spatial power spectrum in a Fisher matrix analysis, Wang et al (2004) 
vary these and the primordial fluctuation spectrum, the dark energy density and equation of state, 
the baryon density. Both they and Mei and Bartlett (2003; 2004) allow extra (l + z) and M factors 
to appear in the Y{M) relation. Mei and Bartlett find a small effect, note they are at (large relative 
to our work here) values of the Y parameter, corresponding to large masses where the gas physics 
isn't as important. Majumdar & Mohr (2003; 2004) vary most of these factors as well in finding 
their constraints. Except for the Mei & Bartlett (2004) paper which considers APEX, these other 
experiments are primarily concerned with far future experiments such as SPT. 

In this work we consider other theoretical uncertainties such as changing the mass function and 
the gas fraction. Another new aspect of this work is comparing all these recognized uncertainties to 
each other, which helps identify which modeling and observational uncertainties need to be reduced 
the most. In addition, as mentioned earlier, we are primarily concerned with the angular power 
spectrum as in the last two works, rather than the correlation function. Unlike the angular power 
spectrum in the Gaussian regime, the errors in the correlation function w{9) are correlated which 
makes it more difficult to understand how well measurements at given separations can determine 
various quantities. 



4.1. Cosmological model dependence 

We start by showing what changes to the cosmological model do to the "vanilla" model, for later 
comparison with the modeling and experimental uncertainties. For instance, the current published 
joint WMAP/SDSS cosmological parameters and errors are Qrn = 0.30 ± 0.04, as = O.SGloin 
(Tegmark et al (2004)). We compare our "vanilla" model with changes in and as, in figure 

3. The correlation function decreases with increasing ag, as a higher a^ means the generalized 
selection function broadens and moves its peak to higher redshift^^. The broadening gives more 
non-correlated clusters nearby any given cluster, in addition the biasing is weaker for high os}^ The 
cluster power spectrum scales quite differently with as than the temperature correlation function. 
If one parameterizes Ci as 

Q{as)^ar^'^m, (19) 

^^Mei & Bartlett (2003) have illustrated this effect in their paper. 
^^We thank the referee for emphasizing the second point. 
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Cosmological parameter dependence 




Fig. 3. — The dependence of the power spectrum on varying ag and fi^ separately. The sohd hne 
is our reference vanilla model with erg = 0.9, Qm = 0.3. The long dashed line is for erg = 0.8, the 
short-long dashed line is for erg = 1.0 and the dotted line is for = 0.25. 
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for Ymin — 1.7 X 10~^ and as — 0.9, a{£) ranges from around 2.7 for i ~ 100 to about 1.6 for 
£ ~ 6000. Doubling Yjnin changes this range to 3.5 and 1.7 respectively (note the negative power of 
(Tg which is different from the positive power scaling behavior of the SZ temperature power spectrum 
(Komatsu & Seljak (2001), Sadeh & Raphaeh (2004)). 

In fact, the effect of changing as is somewhat smaller, as there are other constraints which 
must be satisfied when changing as, for instance the observed and hence fixed number counts of 
clusters. One can include this constraint by requiring the number of clusters to remain fixed along 
with fl^ for instance, and find the required scaling between T^^ and erg (Sadeh & Rephaeli (2004)). 
Allowing Qrn to vary still gives a constraint using identical arguments to Huterer & White (2002) 
(also see Evrard et al (2002)) who were considering the X-ray temperature: Fixing the number of 
clusters with a given Y parameter (an observable) and allowing T^^ , as and flm to vary gives the 
following (directly analogous to X-ray) scaling relation, 

- i'^s^'^r'-' ■ (20) 

More precisely it will be fgasT^^ on the left. The modeling parameter T^^ will be discussed in 
detail in the next section. This relation already suggests a strong degeneracy between the effects of 
changing as, Tf^, Q^, see section §4.3 for more discussion on degeneracies. 



4.2. Modeling Uncertainties 

There are several parameters and functions that go into the theoretical predictions besides the 
cosmological parameters. These can be divided into those related to cluster properties independent 
of the SZ effect, and those related to the transformation from cluster mass to the observable Y 
(equation 8). We will treat both of these in turn. For an SZ selected survey, uncertainties in 
modeling affect results by bringing objects into or out of the survey. As a result, uncertainties 
that have a large effect on the observational properties of the lowest mass clusters in the survey 
{Myir ~ lO^^h'^Mo,) have the most impact. 

Dark matter cluster properties: The cluster properties independent of the SZ effect used 
in the analytic prediction of Q, equation 16, are the correlation function of the dark matter, the 
mass function, and the bias. 

Dark Matter correlation function: The dark matter correlation function is quite well known, 
and can be obtained from the initial power spectrum via a transfer function (such as BBKS (1986) 
or Eisenstein Sz Hu (1997)) and then by implementing a nonlinear power spectrum prescription 
(such as Peacock and Dodds (1996) or Smith et al (2003)). The vanilla model uses the more 
accurate recent fits, i.e., the nonlinear power spectrum fit by Smith et al and the Eisenstein & Hu 
transfer function (the latter is within 3% of the exact CMB power spectrum, M. White, private 
communication). We compare the case with the BBKS transfer function to the vanilla model, as it 
is still in use by many groups. 
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Mass conversions: Before defining the mass functions, it should be recalled that there are 
several different mass definitions in use. For SZ calculations, the mass-temperature relation usually 
involves the virial mass, but the popular mass functions usually are instead for some linking length 
or parameter A such that the mass inside a radius ta is A times the critical density 

Ma = — ApeHtri = / Anr^drp{r) . (21) 

This mass can differ significantly from the virial mass (White (2001)), which enters into the definition 
of y, and conversion must be made between the virial mass and the mass appearing in the mass 
function. The way suggested in White (2001) is to start by assuming an NFW (Navarro, Prenk and 
White (1997)) mass profile, a fitting formula for this method is given in Hu & Kravtsov (2003). 
For instance, in an = 0.3 universe, Myir/M2oo ~ 1.3 for a cluster with concentration 5. A 30% 
difference in mass is significant when one is attempt to make percent-level predictions! Masses enter 
the calculations in the Y{M) relation, the mass function and the bias; consistent definitions or the 
relevant conversions are needed. We show in figure 5 the effect on the vanilla model from using 
M200 rather than the appropriate masses in the Y[M) relation and bias. 

Mass Functions: We compare the effects on the angular power spectrum of the above two 
possibilities, using a less accurate transfer function and neglecting the mass conversion, and three 
commonly used mass functions in figures 4 and 5. 

The three mass functions taken here can be viewed as generalizations of the heuristic mass 
function of Press and Schechter (1974). These are based upon simulations with large enough 
volume to accurately sample the number of rare objects such as galaxy clusters: the Jenkins et al 
(2001) mass function, the Sheth-Tormen (1999) mass function and the Evrard et al (2002) mass 
function. The first two are for masses in terms of Migof^^, while the last is in terms of M2oo- These 
mass functions are expressed in terms oi a{M,z) or iy{M,z). Here a{M,z) is the rms of the mass 
density field smoothed on a scale R — (3M/(47rpb))^/^ with a top hat window function, cr{M, z) — 
Jq°° ^lS.^{k)Wlj{k, z) where Wl^{k,z) is the Fourier transform of window function, v — 5c/(t, and 
Pi, = 0,jnPcrit- We ignore the weak cosmological dependence of Sc and set Sc = 1.686. We then can 
write 

dM^niT^^^"'" ^^iw-' ^^^^ 

to get the three mass functions described in Table 1. 

The mass density field and the correlation function are derived from the primordial power 
spectrum, which we take to be scale free, i.e. n = 1, and normalized by cxg. To get a sense of the 
effects, the (less accurate) earher BBKS (1986) transfer function decreases Q by 5% at ^ = 214, 
Neglecting the mass conversion between M200 and Myj^ decreases C214 by 11% and the Sheth- 
Tormen mass function (1999) decreases it by 19%. Using the Peacock & Dodds (1996) nonlinear 



^^Note that some people use A to refer to the density relative to the mean density. 
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Mass functions 




1016 



Fig. 4. — Evrard et al, Jenkins et al and Sheth-Tormen mass functions. The smooth hne is the 
Evrard mass function, the dotted hne is the Jenkins mass function and the dashed hne is the 
Sheth-Tormen mass function. 



Mass function dependence 




Fig. 5. — On left: The angular correlation function, same models and line labeling as figure 4, 
plus the vanilla model except for BBKS transfer function (dot-dashed line) and vanilla model 
without using mass function conversion for mass-temperature (heavy dotted line). On right: the 
corresponding angular power spectra. 



Table 1: 



mass function 






Sheth-Tormen 


180pb 


0-364(1 + o.811aO-6)e-^ 


Jenkins 


180 pb 


0.301 exp[- 


|ln(T-i + 0.64 


3.821 


Evrard et al 


200 pcrit 


0.22exp[-| 


Inff-i + 0.73I 


3.86j 
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prescription gives no noticeable change, hence we did not show it in the figure. An extensive 
comparison of different cases is given in the table in section §4.3. 

Bias: There are also several different possibilities for bias. The (linear) bias b{M, z) is defined 

via 

i{M,M,r,z)^h\M,z)i{r,z) (23) 

where ^{r, z) is the dark matter power spectrum and ^(M, M, r, z) is the power spectrum of halos 
of mass M. The original idea of peak biasing by Kaiser (1984) has been improved upon with fits to 
simulations. There is the Sheth-Tormen bias (1999), fit to simulations and motivated by a moving 
wall argument, the bias found by Sheth, Mo & Tormen (2001) (SMT), and the bias more recently 
found by Seljak and Warren (2004). The Seljak and Warren bias was found for small masses but 
has the best statistics currently available. It overlaps closely with the Sheth-Tormen bias where it 
is valid but is systematically lower and does not extend very far into the high mass range needed for 
clusters. Thus, using a combination of the two biases would result in a bias which doesn't integrate 
to one when combined with the Sheth-Tormen mass function. Consequently we have taken the 
Sheth-Tormen bias as our default. 



Table 2: 



bias function 


^Pcrit 


bias 


Sheth-Tormen 


180p6 


y> = ^0.7076c/(r = 1.418/a, p = 0.3 


Sheth, Mo & Tormen 


ISOpb 


1_L 1 , /2(l-c) i^'^VV^ 

a = 0.707, 6 = 0.5, c = 0.6 


Seljak & Warren 


imperii 


0.53 + 0.39x0-42 ^ 0.08 ^ 10-43.1.7 

40x+l 

X = M/Mni M = Mj,i^ u = a 



For the SZ selected power spectrum one integrates over all masses greater than some 
SO that what one is actually probing is an integral of the bias, i.e. $(y^j„, z) in equation 14. One 
can define a related (rescaled by the number density) quantity: 



/M_(y..)^^^(-(^))^--...(M,.(r)) 



-'method,ef f 



(24) 



where bmethod is one of the above biases. The linear biasing prescription above doesn't work as 
well for short distances, and for this a "scale dependent bias" has been calculated for the cluster 
correlation functions by Hamana et al (2001) and Diafcrio ct al (2003). This scale dependent bias 
is also a function of the separation r of the objects of interest and for Diaferio et al is 

beff{r, Z, Ymin) = bsT,eff{z, Ymin){l + bsT,eff{z, Ymin)cr{r, z))^'^^ (25) 



the corresponding expression for Hamana et al has an exponent 0.15. Diaferio et al have shown that 
this bias works well for cluster correlation functions for a range of redshifts. We will use the Diaferio 



-18- 



et al case for illustration. The bias of Hamana et al is midway between the linear biasing case and 
the Diaferio et al case (2003). With the Diaferio et al bias, the correlation function becomes 

^(^) = ir rlHn) /o°° rjdr2^r2)^dm{\ri - rsl) (26) 

where 

Idmdri - r2|) = (1 + bsT,eff(z, Ymin)o{T, z))°-™edm(|ri - r2|) (27) 

In the Limber approximation one then finds 

w{Q) = y dyy'^{yf J dx{l + bsT,effiz, Y^^)a{^yW + x\ z)f'\am{^fW^^) (28) 

As this doesn't easily allow a rewriting in terms of Jo{£9), obtaining the power spectrum C^'s is 
somewhat more difficult. In figure 6 we show the correlation function w{6) at left and the power 
spectrum Q at right for the vanilla model and then the Sheth, Mo & Tormen linear bias and the 
Diaferio et al nonlinear bias. (Although the Hamana et al bias is also between the Diaferio et al bias 
and the vanilla bias, it is not what is shown in this figure.) We transformed the difference of w{9ys 
between the vanilla model and Diaferio et al model to get the C(S for the former. We use the vanilla 
model parameters for the Diaferio et al bias (e.g. Tf^ = 1.2 rather than Tf^ = 2.0 as they did). 
The nonlinear bias has the strongest effect at short distances in the correlation function-its effect 
is strongly localized around ~ 0. Therefore its Fourier transform, the angular power spectrum, 
has additional contributions of almost constant magnitude at all i relative to the vanilla model. 
(The limit of this would be adding power at only ^ = 0, which translates into adding a constant 
to the power spectrum.) Some of this difference is deceptive as the errors for Ce are independent 
in the linear regime, while those for w{9) combine the w{9) measurements for different values of 9. 
For a visual comparison, the independence of the errors for Q makes it easier to draw conclusions 
about relations between different parameter choices and uncertainties. Also, even though the power 
spectrum and the correlation function are Fourier transform pairs, estimators for these differ in 
practice when real data is in hand, and give different information when one does not have full 27r 
angular coverage in 9. Ideally one would use both given real data. 

There is also intrinsic scatter around the bias and the mass functions. We will not put this 
in explicitly, however the weak dependence on the scatter in the Y[M) relation (mentioned below) 
leads us to suspect it will not be a large effect. 

Y parameter: The next step is relating the cluster mass to a y parameter, which involves 
more complicated gas physics. In addition, the increased sensitivities of upcoming experiments will 
allow smaller and smaller mass clusters to be detected, which are more and more easily disrupted 
by this gas physics. 

There are actually three questions: the actual form of the mass temperature/y parameter 
relation, the normalization of this relation (i.e. T^^ fgas) and the scatter around this normalization 
for a representative group of galaxy clusters. Simulations alone cannot determine these: the heating 
and cooling properties of clusters are not understood at an accuracy needed for precision cosmology 



Bias dependence 




Fig. 6. — The effect of different bias prescriptions on tlie correlation function w{6) (left) and the 
power spectrum Ce (right). The vanilla reference model is shown in both cases (solid line), as well 
as the cases with the Sheth Mo & Tormen bias in equation 2 (dotted line) and the nonlinear bias 
of Diaferio et al, equation 25 (dashed hne). 
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and so these questions are intermingled by assumptions used. For instance, the scahng relation 
was obtained by assuming an isothermal gas profile. Assuming hydrostatic equilibrium and using 
the total Y parameter means that the details of the profile (many others have been suggested, 
e.g. Komatsu & Seljak (2001), Loken et al (2002)) get absorbed into the mass temperature (or 

Y parameter) normalization or form. If the parameters of the gas profile change with cosmology, 
it's possible that the normalization will also do so, or rather the form of the Y{M) relation, a 
possibility which will need to be checked for carefully in the data. We consider the form of the mass 
temperature/y parameter relation, the normalization, and the scatter in turn. 

Mass temperature/Y parameter relation: There are several different mass temperature relations 
in the literature (see Sadeh & Rephaeli (2004) for a description of five common ones), usually based 
on X-ray mass temperature relations. These relations have been tested both with observations and 
simulations (bear in mind again that the simulations do not seem to yet have all the necessary 
physics), and some generalizations of these relations have also been tested. For instance, equation 
8 can be generalized to include different z and M dependence, such as multiplying by a factor 
(1 + z^M"'. The mass-Y parameter relation also has dependence on fgas which can be generalized 
to change fgas with redshift, or change it differently with mass. Observational data and simulation 
data have been used to search for these effects. 

Most observational tests are of the X-ray mass temperature relations and of the change of fgas 
with mass or redshift, rather than of the Y{M) relation. For example, Ettori et al (2004a) have 
found no evidence for additional evolution in redshift of the mass temperature relation. For scaling of 
mass with temperature, Ettori et al (2004b) and Ota & Mitsuda (2004) find T^^^™*' ~ M^/^-M^/^, 
but the departure from the M^/^ relation is about l.Scr for the Ettori et al data and is marginal 
given the error bars for the Ota & Mitsuda data. The former group also finds marginal (less than 
2a) evidence for clusters of a fixed temperature to have smaller gas mass at high redshift. We have 
already included the fgas dependence with mass found by Lin et al (2003), but have not included 
any redshift dependence. 

More simulations than observations have addressed the z and M dependences of the Y{M) 
relation directly. For example da Silva et al (2004) find numerically that the z dependence seems 
to be well represented by the simple scaling given in equation 8. For lower mass objects da Silva 
et al (2004) find that the M dependence in the Y{M) relation steepens from M^/^. Our use of an 
M-dependent fgas produces an effect in the same direction. However, for clusters with M200 above 
5 X IO^'^'H^^Mq (or Mmr > 6.5 x 10^^h~^MQ), one finds that this can just be absorbed into scatter of 
about 10 % around the M^/^ scaling (White et al (2002)) in the Y(M) relation. The effects of adding 
an additional factor of {1 + zY'M^ to Y{M) has been considered for the angular correlation function 
by Moscardini et al (2002) and Mei & Bartlett (2003), and Wang et al (2004) and Majumdar & 
Mohr (2003; 2004) have considered the effects of this additional scaling on parameter estimation 
from the the power spectrum (the three dimensional one in the former case). 

Normalization ofY{M) relation: For normalization, we have combined all the mass temper- 
ature conversion ignorance into the parameter combination Y oc T^^ fgas- The simplest procedure 
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would be to say that Y — Y^r (and that gas outside this radius does not contribute significantly) 
and to take Tf^ to be the X-ray value, 

rpSZ — rpX—ray 
-'- * — -'- * 

As noted in section §2, there is strong disagreement between simulations and observations for 
T^-'^"'y. Thus the parameter T^-'^^-v is not well determined. Even if it were, using it to normahze the 
y(M) relation is not necessarily justified. Not only do X-ray measurements weight more strongly 

the center of the cluster, as mentioned before, but for the SZ effect the normalization has an 
additional contribution due to line of sight contamination from gas outside the cluster. If one 
takes simulation results (which should be taken with a grain of salt given the above mentioned 
discrepancy), this projection effect on Y{M) raises the normalization about 8% (White et al (2002)) 
above the normalization due to the cluster alone. 

The differences between power spectra for different normalizations of the Y{M) relation are 
shown in figure 7. We also show that taking fgas fixed at 0.06 (the value for a cluster with 
Myir = lO^'^/i~^M0) gives a power spectrum very close to the vanilla model. And we additionally 
have shown a model with T^^ — 2.2keV which has the 8% increase from line of sight projection 
(fromTf^ = 2keV). Of course, as changing T^'^ fgas just rescales the Y parameter, raising T^^ fg^s 
is equivalent to lowering y^j„, i.e. one is probing clusters with smaller mass. 

Scatter: Unlike the normalization, the effect of scatter on the Y{M) relation was very weak. 
Taking a 10-12% intrinsic scatter in the mass temperature relation (from Evrard, Metzler & Navarro 
(1996) for X-ray simulations and similarly from White et al (2002) for SZ simulations) had less than 
a percent effect on the C^, even with APEX sensitivity and accompanying very low mass cuts. For 
a large scatter of about 30%, was roughly decreased by about 3%. A similar robustness to M-T 
scatter was found in the Fisher matrix calculations (Levine, Schulz & White (2002)) and in that 
of number counts (Battye & Weller (2003)). Metzler (1998) also found from simulations that the 
scatter in the mass temperature relation was larger than that for the mass-Y parameter relation. 
Thus for the bulk of the paper, we have used equation 8 and combined all our ignorance into the 
parameters T^^ fgas- We included the unnoticeable 10% scatter in the Y(M) relation in all our 
calculations here. 

We were concerned about sources of scatter which are not included in our analytic description, 
mergers in particular. One might expect that processes such as merging will disrupt the clusters 
and thus invalidate the assumption of virialization used in some analytic calculations. The most 
massive clusters have the most recent mergers (as they are generally the most recently formed 
objects), but these tend to be included automatically in the catalogue as their estimated masses, 
even if inaccurate, are quite high relative to the mass cut. The selection for the catalogue depends 
most sensitively on the least massive clusters included, where mergers are relatively rarer. (However, 
at high redshift the "low mass" clusters are recently formed as they are the most massive collapsed 
objects at that time, so one might expect some effect from them.) Mergers are automatically 
included in the cosmological simulations and the scatter in the Y — M relation is not larger than 
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Y(M) and mass-temperature normalization dependence 




Fig. 7. — The power spectrum for the vanilla model, with T^^ = 1.2keV (solid line), and that 
with Tf^ = 2.0keV (dotted line) as might be suggested by X-ray measurements. A higher value, 
T^^ = 2.2keV (dashed line) is also shown to give an idea of the change that an expected SZ line of 
sight projection effect (around 8%) would give. The dot-dashed line fixes T^^ = 1.2keV but has no 
evolution in fgas with mass, i.e. fgas = O.OQh~^^'^, the value for M^ir = 10^^h~^MQ. 
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that expected due to scatter in the M — T relation (White et al (2002)), leading us to expect 
that merger induced scatter is relatively small. However, as simulations cannot reproduce cluster 
properties precisely yet, observational data will be needed to calibrate this effect. 

4.3. Degeneracies 

We have considered several different choices for theoretical inputs to the calculations of the 
angular power spectrum. With a brief visual inspection it can be seen that there are many de- 
generacies, i.e. that several changes to parameters and modeling seem to have the same effect on 
the power spectrum.^''' Note that this does not take into account other constraints at the same 
time, such as fixed number counts, only one parameter is varied at a time. We show some of these 
degeneracies more explicitly in figure 8, there are other degeneracies in the parameters considered 
here that aren't shown. For instance, choosing a constant fgas rather than having evolution with 
mass, as done in the previous subsection, is degenerate with changing Tf^ from 1.2 keV to 1.3 kcV, 
using the Sheth-Tormen mass function rather than the Evrard mass function is degenerate with 
changing Tf^ from 1.2 keV to 1.3 keV, and neglecting the mass conversion from M200 to Myj^ and 
-^isopi, in the Y{M) relation and bias respectively is roughly degenerate rescahng Q overall by a 
factor of 0.88. 

This can be compared to changing Q^n oi using the nonlinear bias of Diaferio et al, or using 
a less accurate transfer function such as BBKS, all of which change the "shape" of Q differently 
than those above. These are shown on the right of figure 8. 

Values of Ci for these models arc compared quantitatively in the table 3. 

The fact that Qm is not as degenerate with erg as expected shows the limitations of the rough 
scaling estimate made in section §4.1. Some of these degeneracies can be broken by other measure- 
ments, for instance dN/dz, however the degeneracies are very similar, except for the bias. The bias 
has no effect on dN/dz and thus can be taken out easily. In figure 9, we show dN/dz for the same 
models considered for figure 8 (note that we are considering a full steradian). As we have shown 
how choices like the mass function and different biases are degenerate with other parameter choices 
(in most cases), one is only as well measured as the other is known. In order to see how close these 
power spectra are in practice, we now compare to the inherent statistical measurement error. 

4.4. Cosmic variance, sample variance and shot noise 

There are three sources of inherent statistical measurement error for the power spectrum Q in 
the absence of any systematic errors: shot noise, cosmic variance and sample variance. These can 



^''We thank the referee for encouraging us to add more discussion on this issue. 



Degeneracies 




Fig. 8. — Degeneracies in theoretical modeling: On left, the center line is "vanilla" model, the 3 lower 
lines correspond to as = 0.97 (dotted), Jenkins (2001) mass function (dot-dash), and T^^ = 1.7keV 
(dashed), and the 3 upper lines correspond to as = 0.83 (dotted), Sheth-Mo-Tormen (2001) bias 
(dot-dash), and Tf^ = 0.9keV (dashed). On right, models which are not as degenerate. The solid 
line is "vanilla" model, the lower dot-dashed line is the (less accurate) BBKS transfer function, the 
upper dotted line is flm = 0.25 and the top dashed line is the Diaferio et al nonlinear bias. 



Table 3: 



Model 


C214/10-6 


Cioi9/10-^ 


C4132/IO 


vanilla 


4.75 


6.67 


1.00 


= 0.9keV 


5.58 


8.04 


1.18 


as = 0.83 


5.80 


8.04 


1.17 


SMT bias 


5.75 


8.10 


1.21 


T, = 1.7keV 


3.93 


5.37 


0.82 


as = 0.97 


3.98 


5.66 


0.87 


ST mass 


3.87 


4.89 


0.79 


Jenkins mass 


4.13 


5.42 


0.86 


no fgas evol 


4.44 


6.25 


0. 93 


no M conversion 


4.16 


5.89 


0.88 


BBKS 


4.74 


6.67 


1.00 


Diaferio bias 


9.46 


1.91 


4.28 


a„ = 0.25 


5.60 


7.56 


1.06 




Fig. 9. — Degeneracies in theoretical modeling for dN/dz: labeling is the same as figure 7 left, 
except that the upper and lower groups of lines have switched. In addition, changing the bias to 
that of Sheth-Mo-Tormen has no effect on dN/ dz and so that line is now degenerate with the vanilla 
case. 
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be combined to give the standard expression for overall error (see, for example, Knox (1995)): 



The factor of 2£+ 1 is due to the 2£+l independent measurements of the power for any £. For small 
i (large scales) there are very few independent measurements of power in the sky, which is dubbed 
cosmic variance. Sample variance increases the error as the sky coverage fsky < 1 decreases. Shot 
noise is determined by n, the number density of clusters per steradian. 

As the depth of the survey goes up (y^m decreasing), the power spectrum also decreases, as 
there are more and more clusters of lower and lower mass, and these are less correlated. However, 
the shot noise also goes down. On the other hand, as the depth of the survey decreases (l^m 
increasing), the power spectrum becomes restricted to higher and higher mass objects and thus 
goes up. However as these objects are rarer, the shot noise also increases. We can compare the 
errors quantities for 3 examples, APEX, SPT, and Planck. 

We use the vanilla model and again take y^m = 1.7 x 10~^ corresponding to a 5a detection for 
APEX with ST ~ 10//K (at 150 GHz). APEX will survey 100/200 square degrees at two frequencies 
(214 and 150 GHz, corresponding to 1.4 and 2 mm wavelengths), with 0.75' resolution. For SPT, we 
expect about 4000 deg^ and similar sensitivity and resolution. For Planck, wc have all sky and will 
take a rough estimate of Ymin = 10^^ (sensitivity/resolution ranging from 5fiK at 7.0' to 50/iK at 
33.0', depending upon frequency). First we plot the errors due to shot noise, sampling and cosmic 
variance for the Planck specifications above in figure 10. 

In comparison, those for APEX and SPT arc shown in figure 11. The vanilla model has the 
same parameters as earlier (including Ymin = 1-7 x 10^^), and error bars are shown for 100, 200, 
and 4000 square degrees, representative areas for data sets expected from APEX and SPT. The 
largest error bars are for the smallest area. The plot on the upper right has the same observing 
area, but with a minimum Yj^in value of 10~^. This might occur if e.g. Tf^ were 2.0 keV rather 
than 1.2, a reasonable possibility given X-ray measurements and line of sight contamination effects. 
The sensitivity is directly proportional to K„ii„, so "in principle" an experiment can fix area/Y^^^ 
for a given observing time, producing a tradeoff between wide and shallow or narrow and deep. The 
errors given in equation 30 are shown for the vanilla model correlation function and then compared 
to other possibilities with fixed observing time. Note this ignores how the efficiency and difficulty 
of cluster identification changes with Ymin- The errors also neglect the Poisson nature of the shot 
noise and the 3 and 4 point functions of the clusters, thus we are caUing them "naive" . The bottom 
two graphs are the vanilla model plotted with a smaller y^j„ with error bars corresponding to the 
100 and 200 square degree surveys with area up by the factor of 4 or 16 for the left and right graphs 




(30) 



^^Here the shot noise is considered in the Gaussian limit, for the full Poisson errors for the shot noise, which can be 
important, see Cohn (2005). In addition, there arc corrections to the error due to the three and four point functions 
of galaxy clusters which are usually not included and we do not include them here. 
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Planck naive errors 




Fig. 10. — The angular power spectrum with errors from shot noise, samphng and cosmic variance 
error expected from Planck, for a y^j„ = 10~^. The bin size is the spacing between the error bars, 
equal spacing in log I. See text for more details. 
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Area vs. sensitivity 




Fig. 11. — The angular power spectrum with errors from shot noise, samphng and cosmic variance 
error. Error bars are for 100, 200 and 4000 square degrees for the top two plots (vanilla model at 
left, Ymin = 10~^ at right), with the largest error bars corresponding to the smallest area. The lower 
figures correspond to the vanilla model except for the change Ymin '2Ymin,vaniiia {4Ymin,vaniiia) on 
the left (right), with the area going up by a factor of 4 (16), relative to the vanilla model at upper 
left. This naively keeps the observing time fixed. The error bars are equally spaced in log i. See 
the text for more details. 
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respectively. Note that as the sensitivity goes down that the power spectrum and the shot noise go 
up, as mentioned earher-only rarer and thus more clustered objects are included. The bin size is the 
spacing between the error bars, equal in log I. However, the decrease in error bars due to increased 
sky coverage outweighs the increase in shot noise: shallow and wide appears naively preferable to 
deep and narrow. 

In fact, poorer sensitivity is not necessarily a bad thing, as the omitted clusters have low mass 
and thus the strongest dependence on the poorly understood gas physics. Exactly how deep is the 
most profitable will depend very strongly on the normalization of the y(M) relation. In addition, 
the corresponding changes in the purity and completeness of the surveys have not been taken into 
account in this scaling argument, how these properties scale will depend upon the cluster finding 
algorithms in use as well. Fixing the sensitivity and increasing the area, as SPT will do, is of course 
a major improvement in any case. 

In addition, even though the error bars are quite large, the measurement contains useful infor- 
mation. Theory gives smooth and known functions of £, C^, so we can bin Ni nearby values of £, 
reducing the error. For example, we can use the combined shape of the power spectrum (20 bins 
equally spaced in log t) and the errors in equation (30) to find one sigma contours in the ag — ilm 
plane. This is shown in figure 12 for marginahzing over a 10% prior and a 30% prior on T^^ 
and holding all the other uncertainties fixed for our "vanilla" model. These error ellipses clearly 
illustrate the degeneracies inherent in the effects of changing these two cosmological parameters for 
the angular power spectrum.^'' The closest error analysis to ours was done by Mci and Bartlett 
(2004) who used the measurement of w{9) at 30 arcminutes and counts at the flux limit to get error 
contours for ag and Vim of comparable size. Here instead we use the full power spectrum because our 
intent is to illustrate the degeneracies in using the power spectrum to constrain these parameters. 
As Mei and Bartlctt's degeneracy line in ^2^, ag for w(30') differs from the ellipse axes in figure 12, 
the two sets of constraints are complementary. 



^^Modifications of this to include Poisson shot noise (Cohn (2005)) appear to reduce the benefits of the shallower 

surveys, but the exact comparison will require more assumptions about which quantity is of interest, as well as 
inclusion of 3 and 4 point functions and an estimate of when the sampling goes from Poisson to sub-Poisson (Casas- 
Miranda et al(2002), Sheth & Lemson (1999)). 

^°The more precise full Poisson errors (Cohn (2005)) will enlarge these error ellipses by 25-37 % along the long 
axis and by at most 5% along the short axis, and rotate them by a small amount. This more precise calculation 
still has the other assumptions used throughout this paper however: perfect cluster finding and no inclusion of the 
(unknown) cluster three and four point function contributions to the errors. It does include the (negligible) 10% 
scatter in Y{M). 
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5. Conclusions 

Forthcoming SZ surveys such as APEX are expected to observe one or two orders of magnitude 
more clusters than currently in hand. The angular power spectrum will be an immediate result once 
clusters have been identified. There still exists uncertainty in theoretical predictions: we examined 
how those uncertainties affect the cluster power spectrum. 

We calculated the angular power spectrum for different reasonable mass functions, biases, mass- 
temperature normalizations Tf^ and gas fractions fgas and found these changes are comparable to 
changes due to cosmological parameters of interest such as ag within current ranges of interest. 
In particular we identified which modeling uncertainties mimicked changes in the cosmological 
parameters considered (by finding the relevant cosmological parameter values) and which did not. 
Some of these modeling and cosmological dependencies have been studied for the correlation function 
in two dimensions or the three dimensional power spectrum. Different subsets with different fixed 
assumptions have been considered in previous literature at different times. By combining all these 
variations in a homogeneous manner, and including several more which have been identified since, 
the relative importance of the various modeling assumptions can be more directly assessed. 

We find that progress on several fronts is needed before the scientific harvest from these exper- 
iments can be fully realized. 

The uncertainties in the mass function of clusters and the bias can both be improved with simu- 
lations. We showed the differences between many commonly used ones are significant in comparison 
to the uncertainties for the cosmological parameters of interest. 

We have summarized much of the observational and theoretical work on the Y{M) relation, 
important uncertainties still remain. The normalization of the Y(M) relation needs considerable 
observational and/or simulation input (this point has also been emphasized previously by other 
authors as referenced in the text). Current experiments such as SZA might be able to do this 
calibration when combined with other measurements for mass, as long as the normalization is not 
redshift dependent. Simulations with gas are known to have incomplete treatments of physics but 
can be used as a guide, e.g. to calibrate the effect of projection on the mass temperature relation, 
which gives a systematic increase of the Y{M) normalization of about 8%. We included scatter in 
the Y{M) relation, however this was a small effect: the angular power spectrum changed by less 
than 1% when the Y{M) relation was taken to have a scatter (as recent hydrodynamic simulations 
suggest) of 10%. The effects of mergers seem to be small, once the normalization is fixed; we argue 
that the effect of most of the parameters is just to decide whether objects are included in the survey 
or not, and mergers have the largest effect on the largest clusters, which tend to be included anyhow 
for the low values of Y^nin under consideration here. 

We also studied the experimental uncertainties which exist even in the absence of systematic 
uncertainty. We compared area to sensitivity and found that naively a shallower wider survey 
is more powerful. 
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We focused on the angular power spectrum alone, though of course complementarity is key to 
progress. Complementary data will even be provided from survey producing the power spectrum 
itself. A survey providing an angular power spectrum will also produce number counts per square 
degree and dN/dY, number counts as a function of Y. In addition, the temperature correlation 
function and perhaps dN/dz will be available. Various combinations of these quantities have been 
analyzed in the literature. Mei and Bartlett considered number counts and the angular correlation 
function, and combined it with the number counts from X-ray, for instance. This is one example 
of self-calibration (Levine, Schulz & White (2002)); adding the angular power spectrum to other 
measurements will increase the leverage of all of them. 
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and comments on the draft, and the anonymous referee for many constructive suggestions. This 
work was supported in part by DOE, NSF grant NSF-AST-0205935 and NASA grant NAG5-10842. 
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Fig. 12. — The one sigma contours using the full power spectrum and errors in equation (30), 
varying and as, marginalized with a 10% prior on Tf^ (left) and a 30% prior on Tf^ (right). 
The larger contour is for 200 square degrees (APEX) and the smaller one is for 4000 square degrees 
(SPT). The true model is our "vanilla" model, shown by the point at the center. 



